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Abstract 

We study the multiway cut problem in directed graphs and one of its special cases, the node-weighted 
multiway cut problem in undirected graphs. In Directed Multiway Cut (Dir-MC) the input is an 
edge-weighted directed graph G = (V, E ) and a set of k terminal nodes {si, S 2 ,..., sr} C V; the goal is to 
find a min-weight subset of edges whose removal ensures that there is no path from ,s, to sj for any i f j. 
In Node-weighted Multiway Cut (Node-wt-MC) the input is a node-weighted undirected graph G 
and a set of k terminal nodes {si, S 2 ,..., Sk} C V; the goal is to remove a min-weight subset of nodes 
to disconnect each pair of terminals. Dir-MC admits a 2-approximation |[26l and NODE-WT-MC admits a 
2(1 — ^-approximation ||T9l , both via rounding of LP relaxations. Previous rounding algorithms for these 
problems, from nearly twenty years ago, are based on careful rounding of an optimum solution to an LP 
relaxation. This is particularly true for Dir-MC for which the rounding relies on a custom LP formulation 
instead of the natural distance based LP relaxation ll26l . 

In this paper we describe extremely simple and near linear-time rounding algorithms for Dir-MC and 
Node-WT-MC via a natural distance based LP relaxation. The dual of this relaxation is a special case of the 
maximum multicommodity flow problem. Our algorithms achieve the same bounds as before but have the 
significant advantage in that they can work with any feasible solution to the relaxation. Consequently, in ad¬ 
dition to obtaining “book” proofs of LP rounding for these two basic problems, we also obtain significantly 
faster approximation algorithms by taking advantage of known algorithms for computing near-optimal so¬ 
lutions for maximum multicommodity flow problems. We also investigate lower bounds for Dir-MC when 
k = 2 and in particular prove that the integrality gap of the LP relaxation is 2 even in directed planar graphs. 


*Dept. of Computer Science, University of Illinois, Urbana, IL 61801. 

chekurigillinois.edu 

^Dept. of Computer Science, University of Illinois, Urbana, IL 61801. 

vmadan2@illinois.edu 


Supported in part by 
Supported in part by 


NSF grant CCF-1319376. 
NSF grant CCF-1319376. 



1 Introduction 


We study several variants of the multiway cut problem in graphs (also referred to as the mult-terminal cut 
problem). In the classical s-t cut problem the input consists of a graph G = ( V. E ) and two distinct nodes s, t; 
the goal is to separate s from t by removing a minimum cost set of edges and/or nodes. In the multiway cut 
problem the input is a graph G = (1/, E) and a set S = {si, s 2 , ■ ■ ■, Sfc} of k nodes from V called terminals; 
the goal is to separate the ter mi nals from each other at minimum cost by removing edges and/or nodes. We 
describe the three main variants that are of interest to us. 

Multiway Cut (Edge-wt-MC): The input is an undirected graph G = (V, E) along with non-negative edge 
weights w(e),e E E and a set {si,..., s^} C V of terminals. The goal is to find a min-cost set of edges 
E' C E such that in G — E' there is no path from s,; to Sj for i f j. 

Node-Weighted Multiway Cut (Node-wt-MC): The input is an undirected graph G = (V,E) along 
with non-negative node weights w(v),v E V and a set {si,..., Sfc} C V of terminals. The goal is to find a 
min-cost set of nodes V' C V such that in C — V there is no path from .s, to Sj for i f 

Directed Multiway Cut (Dir-MC): The input is a directed graph G = ( V, E) along with non-negative 
edge weights w(e),e E E and a set {si,..., C V of terminals. The goal is to find a min-cost set of edges 
E' C E such that in G — E' there is no path from s t to Sj for i j. 

Remark 1.1. DlR-MC with k = 2 is not the same as the s-t cut problem. The goal is to separate s 1 from s 2 
and S 2 from si. In fact Dir-MC with k = 2 is NP-Hard II7\I . 

The complexity of the multiway cut problem and its variants have been extensively studied since the paper 
of Dahlhaus et al. fill . They showed that Edge-wt-MC with k = 3 is NP-Hard; it was later observed that 
the problem is also APX-hard to approximate. This is in contrast to the case of k = 2 which can be solved in 
polynomial-time in undirected graphs via a reduction to the s-t minimum-cut problem. 

Edge-wt-MC reduces in an approximation preserving fashion to Node-wt-MC which in turn reduces 
in an approximation preserving fashion to Dir-MC lfl9l : it is also easy to see that in the directed case, node- 
weighted and edge-weighted versions are equivalent. The current best approximation ratio for Edge-wt-MC 
stands at 1.2965 due to Sharma and Vondrak (281 . For Node-wt-MC a 2(1 — 1 /k) approximation is known 
from the work of Garg, Vazirani and Yannakakis lfl9ll . and for Dir-MC a 2 approximation is known from the 
work of Naor and Zosin (26]]. Vertex Cover reduces to Node-wt-MC and Dir-MC in an approximation 
preserving fashion fl9l . Assuming P f NP Vertex Cover is hard to approximate to within a factor of 1.36 
l lT2l . and assuming the Unique Games Conjecture it is hard to approximate to within a factor of (2 — e) for any 
fixed e > 0 (23Tl . These hardness results apply to Node-wt-MC and Dir-MC and show that Edge-wt-MC 
is provably easier to approximate than them. 

Our focus in this paper is on approximation algorithms for Node-wt-MC and Dir-MC. The known al¬ 
gorithms are based on rounding suitable LP relaxations for the problems. For both problems there is a simple 
and natural LP relaxation based on distance variables on nodes/edges; see Section [2] and [3] (We note that a 
similar relaxation applies to the more general Multicut problem and that dual of the LP relaxation corre¬ 
sponds to the LP for maximum multicommodity flow.) For Node-wt-MC the algorithm of Garg, Vazirani and 
Yannakakis Ifl9l shows that any optimum solution to the relaxation can be converted to a half-integral optimum 
solution which can then be rounded easily. The situation for Dir-MC is much more involved. Unlike the case 
of Node-wt-MC, half-integral optimum solutions may not exist for the relaxation even for k = 2. Garg et 
al. ffl7l obtained an 0(log /^-approximation via the relaxation using ideas from approximation algorithms for 
multicut fniil . Naor and Zosin obtained a 2-approximation for Dir-MC in an elegant, surprising and somewhat 

1 In this definition terminals are allowed to be removed. If they are not allowed to he removed we can simply make their weight 00 . 
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mysterious fashion. They write a different LP relaxation called the relaxed multiway flow relaxation which is 
within a factor of 2 of the natural relaxation, and show that an optimum solution to this new relaxation can be 
rounded without any loss in the approximation. This gives an indirect proof that the natural relaxation has an 
integrality gap of at most 2. The proof of correctness crucially relies on complementary slackness properties of 
the optimum solution and is partly inspired by the ideas in lfl9l . The idea of using a relaxed multiway flow is 
inspired by earlier work on the subset feedback vertex problem lfl4l . 

The algorithms of lfl9l and ll26l are from almost twenty years ago. During this intervening years no alterna¬ 
tive algorithms or rounding schemes have been obtained for these basic problems. We observe that for the case 
of Edge-wt-MC there is an extremely simple rounding scheme that converts any fractional feasible solution 
to a multiway cut with a loss of a factor of 2 (see Il29l ). The algorithm picks a random 0 e (0,1/2) and for each 
terminal s t removes the edges leaving the ball B(s t . 9) of nodes contained within a radius 9 around s t (with 
respect to distances given by the LP solution); more formally the output is Ui=i 9)). 

In this paper we show that very simple algorithms which are essentially similar in spirit to the above scheme 
also work for Dir-MC and Node-wt-MC! 

• The rounding algorithms are extremely simple and natural to describe, and in retrospect also to analyze. 

• The algorithms only require a feasible solution to the natural LP relaxation and not necessarily an opti¬ 
mum solution. 

• Given a feasible fractional solution, the rounding algorithms can be implemented in time that is similar 
to what is required for one single-source shortest path computation. The deterministic version requires 
an additional logarithmic factor. 

In addition to algorithmic results we also obtain some lower bound results for Dir-MC with k = 2; the 
goal is to separate s from t and t from s in a directed graph G; subsequently we refer to this special case as 
■s/-Bi-Cut. We prove that the natural LP relaxation has an integrality gap of 2 for sf-Bi-Cut even in planar 
directed graphs. 

We believe that our algorithms and analysis will be useful for related problems. Indeed one of our moti¬ 
vations for simplifying the rounding schemes for Dir-MC and Node-wt-MC came from attempts to obtain 
algorithms for a problem with applications to network information theory |[8j]. A significant consequence of 
our rounding algorithms are much faster approximation algorithms for Node-wt-MC and Dir-MC in both 
theory and practice. Solving the LP relaxations for Node-wt-MC and Dir-MC to optimality is quite chal¬ 
lenging. The options are to use the Ellipsoid method or to use a compact formulation with a very large number 
of variables and constraints. As we remarked earlier, the dual of the natural LP relaxation for these problems is 
the maximum multicommodity flow problem. Combinatorial fully-polynomial time approximation schemes for 
solving these multicommodity flow problems have been extensively investigated in theoretical computer sci¬ 
ence and mathematical programming with a number of techniques developed over the years; we refer the reader 
to ll27l i20l -30, 2] [161 fl5l l3l [24 1. Thus, a fast (1 + ^-approximation for the LP relaxation for Node-wt-MC 
and Dir-MC can be obtained using these methods. The fastest theoretical algorithms run in time 0(m 2 /e 2 ) 
mm or in even faster 0(mn/e 2 ) time l24l under some mild conditions; here rn is the number of edges and 
n is the number of nodes in G and O suppresses poly-logarithmic factors. Note that these running times are 
independent of k. Our rounding algorithms can convert such an approximate feasible solution to an integral cut 
in near-linear time with a factor of 2 loss in the cost. Thus, we can obtain provably fast (2 + e)-approximation 
algorithms. Since our focus is on the rounding algorithms we do not go into further details of specific algorithms 
or running times for solving the relaxation. 

We refer the interested reader to quickly jump to Section [2] to see the simplicity of the rounding scheme 
and its analysis for Dir-MC that achieves a bound of 2. This also applies to Node-wt-MC via a simple 
reduction to Dir-MC. We also discuss some new observations on the hardness of the problem when k = 2. In 
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Section[3]we give a slightly different rounding scheme for Node-wt-MC that achieves an improved bound of 
2(1 — 1/k), matching the known ratio from Ifl9ll . 

1.1 Other related work 

The natural LP relaxation for Edge-wt-MC has an integrality gap of 2(1 — 1/k). Approximation algorithms 
for Edge-wt-MC received substantial attention following the breakthrough work of Calinescu, Karloff and 
Rabani (5J. They developed a new “geometric” LP relaxation (henceforth referred to as the CKR-relaxation) 
which they used to obtain a (1.5 — 1 /A:) -approximation. The integrality gap of the CKR-relaxation, and con¬ 
sequently the approximation ratio, was improved subsequently to 1.3438 by Karger et al. lf22l . to 1.32388 by 
Buchbinder et al. 0J, and to the currently best known bound of 1.2965 by Sharma and Vondrak Il28ll . For 
k = 3 a tight bound of 12/11 is known EES). It is also known that assuming the Unique Games Conjec¬ 
ture, for any fixed k, the approximability threshold for Edge-wt-MC coincides with the integrality gap of the 
CKR-relaxation ll25l . 

The CKR-relaxation makes use of the observation that Edge-wt-MC can be viewed as a partition problem 
where the goal is to partition the node set V(G) into k parts V \...., 14 to minimize Ya =i u/4(Vj)) subject 
to the constraint that for 1 < * < k. Si € Vi. Submodular Multiway Partition (Sub-MP) is a gen¬ 
eralization from the setting of graphs to arbitrary submodular functions. Here we are given a non-negative 
submodular function / : 2' —> M + over the ground set V along with terminals {si,..., .sy,} C V. The goal is 
to partition V into Vj ..... 14 to minimize Yli=i f(Vi) subject to the constraint that s* 6 Vi for 1 < i < k. If 
/ is symmetric, as in the case of the undirected graph cut function, we obtain the Symmetric Submodular 
Multiway Partition (Sym-Sub-MP) problem. These problems were considered by Zhao, Nagamochi and 
Ibaraki OTTl who analyzed greedy-splitting algorithms, and more recently by Chekuri and Ene 0 who used a 
Lovasz-extension based convex relaxation. Interestingly, the convex relaxation when specialized to Edge-wt- 
MC yields the CKR-relaxation.Chekuri and Ene 0 obtained a (1.5 — 1 //./-approximation for Sym-Sub-MP 
and 2-approximation for Sub-MP. Ene, Vondrak and Wu lfl3ll improved the bound for Sub-MP to 2(1 — 1/k) 
and also obtained lower bound results in the oracle model. 

Node-wt-MC cannot be viewed as a partition problem directly. Nevertheless, it can be seen that Node- 
wt-MC is equivalent to Hypergraph Multiway Cut problem (Hypergraph-MC) which is a gener¬ 
alization of Edge-wt-MC from graphs to hypergraphs. Hypergraph-MC can be cast as a special case 
of Sub-MP (note that the reduction uses a non-symmetric submodular function /) and thus Node-wt-MC 
can be indirectly reduced to a partition problem. This leads to an alternative 2(1 — 1/k )-approximation for 
Node-wt-MC based on the Lovasz-extension based relaxation for Hypergraph-MC. This relaxation does 
not result in a better worst-case approximation than the distance-based relaxation, however, it appears to be 
strictly stronger in that it improves the approximation ratio in special some cases as observed in 0. No fast 
approximation algorithms are known to solve this convex relaxation. 

Finally we mention the Multicut problem where the goal is to separate a given set of k node-pairs 
(si, ti),..., (sfc, tk) in a given graph at minimum-cost. One can consider undirected graphs with edge weights, 
undirected graphs with node weights and directed graph with edge weights. These versions generalize the cor¬ 
responding multiway cut problems. The best known approximation ratio for Multicut in undirected graphs 
is 0(log k) lfT8l[T7 1 while the best known bounds in directed graphs is min(fc, ()(rk l/23 )) (T). Moreover, it is 
known from the work of Chuzhoy and Khanna iflOll that the problem in directed graphs is inapproximable to a 
factor better than H(2 logl e n ). 
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2 LP Relaxation and rounding for Dir-MC 


Dir-MC can be naturally formulated as an integer linear program with variables x e G {0, 1}, e € E which 
indicate whether e is cut or not. Let Vij be the set of all directed paths from s, to Sj in G. The constraint that s t 
is separated from sj by the cut can be enforced by requiring that ^) egp Xe — 1 f° r eac ^ P e Pij- This leads to 
the following LP relaxation where the integer constraint x e G {0, f} is replaced by x e G [0,1], We can without 
loss of generality drop the constraint x e < 1. 


Dir-MC-Rel 


min 2_ w e x e 



e£E 



J2 X e 

> 1 

p e Vij,i f j 

e£p 



X e 

> 0 

e G E 


Figure 1: LP Relaxation for Dir-MC 

The main result of the paper is the following theorem. 

Theorem 2.1. There is a randomized algorithm that given a feasible solution x to DlR-MC-REL returns a 
feasible integral solution of expected cost at most 2 w e x e , and runs in 0(m + n log n) time. The algorithm 
can be derandomized to yield a deterministic 2-approximation algorithm that runs in 0(m log n) time. Here, 
m = \E(G)\,n = \V(G)\. 

We now describe the simple randomized ball-cutting algorithm that achieves the properties claimed by the 
theorem. Let x be a feasible solution to Dir-MC-Rel. For any two nodes u, v G V we define d x (u, v ) be the 
shortest path length from u to v using edge lengths given by x. For notational simplicity we omit the subscript 
x since there is little chance of confusion. The algorithm adds new nodes t\, £ 2 , ■ ■ ■, £/ c and adds the edge set 
{(£j, Sj) \ i j} and sets the x value of each of these new edges to 0. Note that, this is in effect a reduction of 
the Dir-MC for the given instance to a Dir-Multicut instance which requires us to separate the pair's (t r , sf), 
1 < i < k. The solution x augmented with the extra nodes and edges leads to a feasible fractional solution 
for this Dir-Multicut instance. Our algorithm, formally described below, is very simple. We pick a random 
9 G (0,1) and take the union of the cuts defined by balls of radius 9 around each £j. More formally let B(v, r) 
be the set of all nodes at distance at most r from v. Then the algorithm simply outputs |J - = i <5 + (T>(L- 9)) where 
d + ( A ) denote the set of outgoing edges from A. 


Algorithm 1 Rounding for Dir-MC 
l: Given a feasible solution x to Dir-MC-Rel 

2: Add new vertices t\,.. ., £*,, edges (£», Sj) for all i f j and set x(ti, Sj ) = 0 
3: Pick 9 G (0,1) uniformly at random 
4 : c = uf =1 s+(B(t i ,e)) 

5: Return C 


Note that C is a random set of edges that depends on the choice of 9. We denote by C{9) the set of edges 
output by the algorithm for a given 9. 

Lemma 2.2. If x is a feasible fractional solution to Dir-MC-Rel, C{9) is a feasible multiway cut for 
{.si,..., Sk } for any 9 G (0,1). Thus, Algorithm [7] always returns a feasible integral solution given a fea¬ 
sible x. 
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Proof: Fix any i G {1,..., k} and 9 G (0,1). Since d(B, Sj ) = 0 for all j / i, we have that Sj G B(ti, 9) for 
all j / i. Moreover, by feasibility of x, we have dit,, si) > 1 for otherwise there will be a path of length less 
than 1 from some sj to s, where j f i. Therefore 0 B(ti, 9) because 9 < 1. Therefore, G — d + (B(t,, 9)) has 
no path from Sj to Sj for any j / i. Since C(9) = (J* <5 + (-B(ij, 9)), it follows that there is no path in G — C(8) 
from Sj to Si for any j i. □ 

We now bound the probability that any fixed edge e is cut by the algorithm, that is, Pr[e G C]. Note that 
e may be simultaneously cut by several t t for the same value of 9 but we are only interested in the probability 
that it is included in C. 

Lemma 2.3. For any edge e G E, Pr[e G C] < 2x e . 

Proof: Let e = (u, v). Rename the terminals such that d(si,u) < d{s2, u) < ■ ■ ■ < d(sk,u). This implies that 

d(h,u) = d(s2,u) 


and 

d(t-2, u) = d(ts, u) = ... = d(tk,u) = d(si,u). 

Edge e G 5 + (B(ti, 9)) if and only if 9 G [d(ti, u),d(ti, v))\ we have that d{B,v ) < d(U, u) + x e . Defining the 
interval I t as [d(tj, u), d(ti,u ) + x e ), we see that e G () + [B{t r , 9)) only if 9 G Flowever, from the property 
that d(t 2 ,u) = d(ts,u ) ... = d(tk,u), h = h = • • • = h- Thus, e G C only if 9 G I\ or 9 G h and since |/i| 
and |/ 2 | arc both at most x e long and 9 is chosen uniformly at random from (0,1), 

Pr[e G C] < Pr[0 G Ji] + Pr [9 G I 2 ] < 2x e . 


□ 


Corollary 2.4. E[(7], the expected cost ofC, is at most 2 Yi e w e x e . 


Running time analysis and derandomization: A natural implementation of Algorithm^ would first choose 
9 and then compute 5 + (B{ti, 9)) for each i. This can be easily accomplished via k executions of Dijkstra’s 
single-source shortest path algorithm, one for each B, leading to a running time of (){k{rn + nlogn)) where 
m = \E\ and n = \V\. However, by taking advantage of our analysis in Lemma [273} we can obtain a run time 
that is equivalent to a single execution of Dijkstra’s algorithm. 


Consider a slight variation of Algorithm [I] For each edge e = (u, v), define two intervals h(e) = 
[d(si, it), d(s\,u) + x e ) and h(e) = [d(si, u), d(s\,u) + x e ), where si, s 2 are the two terminals from which 
u is the closest in terms of distance. We pick 9 G (0,1) uniformly at random and include e in C iff 9 G Ji(e) 
or 9 G h(e.)- The analysis in Lemmas 2.2 and 2.3 shows that even this modified algorithm outputs a feasible 
cut whose expected cost is at most 2 Y^ e w e x e . Note that the edges cut by this modified algorithm may be 
a strict superset of the edges cut by Algorithm [T] The advantage of the modified algorithm is that we only 
need to calculate h{e) and / 2 (e) for each edge e G E. To do this, for each node u, we need to find the two 
terminals from which u is the closest and their corresponding distances. More formally, consider the following 
/i-nearest-terminal problem. 


Problem 1. Given a directed graph G = (V, E) with non-negative edge-lengths, a set S C V(G) of k ter¬ 
minals, and an integer h < k, for each vertex v, find the h terminals from which v is the closest among 
the terminals and their corresponding distances. In other words for each v find the h smallest values in 
d(si,v), d(s 2 , v),..., d(sk, v) where S = {si,..., s fc }. 


The above problem can be solved via a randomized algorithm using hashing that runs in expected time 
0(h(m + nlogn)), which corresponds to h executions of Dijkstra’s algorithm. It can also be solved in 
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0{hm log h + hn log' to) time via a deterministic algorithm. See lf2Tl who refers to this as the h-nearest- 
neighbors problem. 

Using the algorithm for the /i-nearest-terminal problem with h = 2, we can calculate I\{e) and / 2 (e) for 
each e G E in 0{m + n log to) timcQ We then chose 6 uniformly at random from (0,1) and cut e if 6 lies 
in one of the range I\ (e) or / 2 (e). This gives us a 2-approximate randomized algorithm with running time 

0 (m + to log n). 

We can derandomize the algorithm by computing the cheapest cut among all 0 e (0,1) as follows. Once 
/i(e) and / 2 (e) are computed for each e we sort the 4 m end points of these 2 m intervals; let them be 0 \ < 62 < 

... < Q\ m - We observe that it suffices to evaluate the cut value at each of these values of 6 . A simple scan of 
these 4m points while updating the cut-value at each end point can be accomplished in 0{m) time. Sorting the 
end points takes 0(m log to) time. This leads to a deterministic 2-approximation algorithm with running time 
0 (m log to). 

2.1 Dir-MC with k = 2 

In this section we address Dir-MC with k = 2 which we refer to as sf-Bi-Cut. We believe this is an interesting 
problem on its own as it is related closely to the classical s-t cut problem. As we remarked earlier, st-Bi-Cut 
is NP-Hard and APX-Hard to approximate. This was shown in ifTTI IT9il via a simple approximation preserving 
reduction from Edge-wt-MC with k = 3. Another consequence of the reduction is that the integrality gap of 
Dir-MC-Rel for sf-Bi-Cut is at least 4/3. On the other hand no ratio better than 2 is known for sf-Bi-Cut. 
This naturally raises the following question. 

Question 1. What is the integrality gap of Dir-MC-Rel for st-Bi-Cut? What is the approximability of 
st-Bi-Cut? 

We obtain two theorems. The first one shows that the integrality gap for sf-Bi-Cut is 2. 

Theorem 2.5. Integrality gap of DlR-MC-REL/or st-Bi-Cut is 2 even in planar directed graphs. 

The second theorem slightly extends a result in llT9ll . 

Theorem 2.6. There is an approximation preserving reduction from 4-terminal NODE-WT-MC to st-Bi-Cut. 
We raise the following question. 

Question 2. Can we prove a factor 2 hardness of approximation for DlR-MC under the assumption that 
P f NP? Does a factor of 2 hardness hold for st-Bi-Cut even under the Unique Games conjecture? 


Integrality gap construction: Proof of Theorem 2.5 is based on recursively defined sequence of graphs 
Gq,Gi, ..., Gh with increasing integrality gap; we will use a* to denote the integrality gap (we also refer to 
this as the flow-cut gap) in G r . The two terminals will be denoted by s, f. The symmetry in the construction 
will ensure that in Gi the s-t cut value will be equal to the t-s cut value; we refer to these common values as 
the one-way cut value and the optimum value of a cut that separates s from t and t from s as the two-way cut 
value. The graph Gq is shown in Fig [2] and it is easy to see that ao = 1. 

The iterative construction of G/+i from G, is shown at a high-level in figure [2] A formal description 
is as follows. To obtain G /+1 with terminals s,f we start with two copies of Gi with terminals si,fi and 
S 2 ,f 2 (denoted by H,H') and two new vertices v\,V 2 - We set s = si, t = f 2 and identify t\ and si as the 


2 One can easily derive the h = 2 case from first principles also. 
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Figure 2: Go on the left and constructing Gj+i from G, shown on the right. 


center vertex v shown in the figure. We add edges (iq. v) and (v,V2) with weight 1 and four other edges 
{(s, ui), (f, v\), (v2, s), (v2,t)} each with weight infinity. Finally we scale the weights of the edges of H and 
H' such that the two-way cut value in each of them is 2 "*. . It is easy to observe inductively that the each graph 
in the sequence is planar and moreover the graph can be embedded such that s and f are on the outer face. The 
analysis of the integrality gap of this construction can be found in the appendix. 

Subsequent to our construction, Julia Chuzhoy obtained an alternative non-recursive construction with an 
integrality gap of 2 for sf-Bi-Cut. 


Reduction from 4-terminal NODE-WT-MC to sf-Bi-Cut: Given a Node-wt-MC instance with graph G 
and set of terminals {si, s 2 , S 3 , S 4 }, Figure [5] shows the ingredients of a reduction to Dir-MC instance with 
graph G' and terminals s, f. This is a slight modification of the reduction from three-terminal Edge-wt-MC to 
sf-Bi-Cut given in fl9l . It is convenient to consider the node-weighted version of Dir-MC which is equivalent 
to the edge-weighted version. Formally G' is obtained from G by the addition of two new nodes s, f which are 
connected to the terminals via directed edges of infinite weight as shown in the figure. Each edge uv £ E(G) 
is replaced by two directed edges (u, v) and (v, u) and the weights of the nodes of G remain the same. We will 
assume without loss of generality that the terminals si, S 2 , S 3 , S 4 have infinite weight. A relatively simple case 
analysis shows that G C V (G) is a feasible node-multiway cut for the terminals { s 1,..., 54} in G iff G is a 
feasible node-multiway cut in G' for {s, f}. This type of reduction does not seem to generalize beyond four 
terminals. 




Figure 3: Reduction from 4-terminal Node-wt-MC to sf-Bi-Cut. Non-terminal vertices are not shown. 


Garg et al. lH9l showed that Dir-MC-Rel does not necessarily have half-integral opitmum solutions. In 
Section [B] we extend their example to show that for every non-negative integer l there exist instances for which 
there is no optimum solution to Dir-MC-Rel that is \/i integral. 









3 LP Relaxation and rounding for Node-wt-MC 


The LP relaxation for the Node-wt-MC is similar to the one for Edge-wt-MC. We have a variable x v £ 
{0,1} for each v £ V which indicates whether to remove v or not. We can assume without loss of generality 
that we cannot remove the terminals s\, S 2 , ■ ■ ■, Sk and moreover that they form an independent set. This can 
be accomplished by adding to each original terminal Sj a new dummy terminal s[ and adding the edge Sjff. Let 
Vij be the set of all paths between .sy and Sj in G. Note that in the undirected graph case we do not need to 
distinguish P tJ from Pj.; . Let S = {si, s 2 , ■ ■ •, s*,} be the set of terminals. 


Node-MC-Rel 



min w v x v 



veV\S 




> 1 

p £ Vij,i < j 

v£p 



x v 

= 0 

v€S 

x v 

> 0 

v £ V 


Figure 4: LP Relaxation for Node-wt-MC 


Theorem 3.1. There is a polynomial-time randomized algorithm that given a feasible solution x to NODE- 
MC-REL returns a feasible integral solution of expected cost at most 2(1 — 1 /k) w v x v , and runs in 0(m + 
n log n) time. The algorithm can be derandomized to yield a deterministic 2-approximation algorithm that runs 
in 0(kn + m + n log n) time. 

Let x be a feasible fractional solution to Node-MC-Rel. For nodes u and v we define d x (n. v ) to be the 
length of the shortest path between u and v according to the node weights given by x; we count the weights of 
the end points u and v in d x (u, v ). We omit the subscript x in subsequent discussion. For a given radius r and 
node u let B(u, r ) be the set of all nodes v such that d(u , v) < r; B(u , r) is the ball of radius r around u. We 
define the “boundary” of radius r from it, denoted by B + (u. r ) to be the set of all nodes that are not in B(u, r) 
but have an edge to some node in B(u, r). 

Proposition 3.2. A node v £ B + (u,r) iff r < d(u,v) < r + x v . Further, if v £ B + (u,r) for r < 1 then 

tXv / 0 . 

Our rounding algorithm first picks an index £ uniformly at random from {1,2 ,k}. It then picks a 6 
uniformly at random from (0,1/2). For each i f £ it includes in the final cut C all nodes v that are in the 
“boundary” of the ball of radius 0 around s*. The formal description is given in Algorithm [2] 

Algorithm 2 Rounding for Node-wt-MC 
l: Given feasible fractional solution x to Node-MC-Rel 
2 : Chose £ £ {1, 2,..., k} uniformly at random 
3: Pick 9 £ (0,1/2) uniformly at random 

4: C = U i#B+( Si ,0) 

5: Return C 


Let C(£. 9) be the output of the algorithm for fixed £ and 9. We first argue that the algorithm always returns 
a feasible multiway cut. 
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Lemma 3.3. For all L 9, C(t, 9) is a feasible multiway cut for the given instance. That is, G — C((, 6) has no 
path from Si to Sj for i j. 

Proof: Consider any pair i, j G {1,2..... k } where i f j. Assume i f l, the case when j f £ is similar. The 
ball B(si, 9) does not contain Sj since 9 < 1/2 and d(si, Sj ) > 1 by feasibility of x. C(£, 9) contains all nodes 
from B + (si, 9), thus, in G — C(£, 9) there cannot be a path from Sj to any node in V \ B(si , 9), and hence to 

Sj. □ 

We say that v is cut by the algorithm if ?; G C. The key to the performance guarantee of the algorithm is 
the following lemma. 

Lemma 3.4. Pr[n G C\ < 2(f — l/k)x v . 

Proof: Fix a node v and rename the terminals such that d(si,v) < d(s 2 ,v) < ■ ■ ■ < d(s k ,v). Define the 
interval /, as [d(si, v) — x v . min(d(sj, v), 1/2)). From the algorithm description and Proposition |3.2| we can 
see that v G C iff 3i such that if- i and 9 G f. 

Note that f is an empty interval if x v = 0 or d(s\,v) — x v > 1/2. Hence we can assume that x v > 0 and 
d(s\, v) — x v < 1/2, otherwise /, is empty for all z and Pr [v G C] = 0. We now consider two cases depending 
on whether d(s 2 , v ) — x v is greater than 1/2 or not. 

First, consider the case when d(s 2 , v) — x v > 1/2. Interval I 2 is empty. Since d(s 2 , v) < d(ss,v) < • • • < 
d(sk, v), intervals I 3 , 14 ,..., //,. arc also empty. Hence, v G C iff £ f 1 and 0 G I \. Interval Ii has length at 
most x v and 9 is chosen uniformly at random from (0,1/2). Therefore, 

Pr[n G C] = Fi[£ f 1] Pr [9 G /,] < (l - \) ■ 2x v . 

rv 

In the preceding equation we used independence in the choice of £ and 9. 

Next, consider the case when d(s 2 , v) — x v < 1/2. From the feasibility of x, we have that d(s\,v) — x v + 
d(s 2 , v) > 1 (recall that d(si,v) and d(s 2 , v) include the length of x v ). This implies that d(s 1 , n) > 1/2. Since, 
d(si,v) > d(s 1 , n) for all i, we have d(si,v ) >1/2 which implies that for all i, f = [d(si,v) — x V: 1/2). Easy 
to see that Ij D I 2 • ■ O I k . Therefore, v G C iff £ = 1 and 9 G I 2 or £ 7 ^ 1 and 9 G I\. Length of interval Ii 
and I 2 are 1/2 — d(s\, v) + x v and 1/2 — d(s 2 , v) + x v respectively. Hence, 

Pr[n G C] = Pr[£ = 1] Pr[0 G I 2 ] + Fv[£ f 1] Pr[0 G h] 

= l/k ■ 2(1/2 - d(s 2 ,v) + x v ) + (1 — 1/k) ■ 2(1/2 - d(si,v) + x v ) 

< 2(1 - l/fc)(l — d(si,v) — d(s 2 ,v) + 2x v ) 

< 2(1-1 /k)x v 

In the penultimate inequality above, we use the fact that 1 — 1/A; > 1/ArifA: > 2. The final inequality follows 
from already stated observation, d(si, v ) + d(s 2 ,v) — x v > 1 due to feasibility of x. □ 

Corollary 3.5. E[u>(C)] = Ylvev Wv ^ 1 \. v £ C] < 2(1 — l/k)'f2 v w v x v - Thus, the expected cost of the cut 
output by the algorithm is at most 2(1 — l/k) times the cost of the fractional solution x. 


Running time: Algorithm [2] can be implemented in 0(m + n log n) time, in a fashion very similar to the 
implementation of the modified version of Algorithm [I] First, we pick £ uniformly at random from {1..... /,:} 
and 9 uniformly at random from (0.1/2). Then, for each vertex v we find the closest terminal s in the set 
S \ {sp} and cut vertex v if d(s, v) — x v < 9 < d(s, v). Finding nearest terminal for each vertex can be done 
in 0(m + nlogn) time. Hence, we get a randomized 2(1 — 1 /A:) -approximation rounding scheme in time 
0(m + nlogn). 


To derandomize, we consider for each v intervals I\ (v) and I 2 {v) as in the proof of Lemma 3.4 Using the 
/i-nearest terminal algorithm for h = 2 with S as the set of terminals, in 0(m + n log n) time, we can compute 
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h(v) and I-ziy ) for all v. We sort the 4n end points of these 2 n intervals and let them be 9\, 62 , ■ ■ ■, 0^ n . It 
suffices to find the cost of the cut for each 9 from this 4n values and for each £ e {1,2,..., k}. We process 
these sorted values in order and for each 9, we calculate w(C(£, 9)) for all The proof of Lemma 3.4 shows 
that this can be done by using only I\ (v) and J>(v) for all v. As we process the end points in the sorted order 
the time to update the cut for each £ per end point is 0(1). Thus, in 0(nk + m + n log n) time we can obtain a 
deterministic algorithm that gives a 2(1 — 1 /A:)-approximation. 


Acknowledgments: CC thanks Sudeep Kamath, Sreeram Kannan and Pramod Viswanath for extensive dis¬ 
cussions on the problems considered in fU which inspired us to revisit the rounding schemes for multiway cut 
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A Proof of Theorem 12.51 


Here we prove the correctness of the integrality gap construction described in Section 2.1 


The following proposition is easy to establish based on the symmetry in the construction of the graphs. 
Proposition A.l. The s-t cut value and the t-s cut value in G l+ \ are the same. 


Now, we calculate ay+i in terms of a 1: . We refer to the copy of G, containing s and v with scaled capacities 
as H, and the one containing v and t as II'. 

Lemma A.2. For i > 0, ay+i = ■ For i > 0, the ratio of of the one-way cut value to the two-way cut value 

in Gi is 

Proof: Proof by induction on i. For the base case we see that op = 1 and in Go the one-way cut value and 
two-way cut value are both 1 and hence the ratio is equal to 1 = ^. 

We now prove the induction step. For this purpose we estimate the one-way cut value and the two-way cut 
value in G l+ \. 

Minimum two-way cut: Any finite value cut that separates s from t has to cut at least one of the two edges 
(vi,v), ( v , V 2 )- We consider two cases. 

Case 1: Both (iq, v), ( v, V 2 ) are cut. To separate s and t it is best to pick a two-way cut between s and v in H 
(or symmetrically between v and t in H'). Thus the total cost is 2 + . 

Case 2: Only one of the edges (ni,n), ( 0 , 02 ) is cut. Without loss of generality this edge is (v,V 2 ). Since 
(v\,v) is not cut s and t can reach v via v\. Thus any two-way cut in G needs to use a one-way cut in FI to 
separate v from s and a one-way cut in IF to separate v from t. The cost of each of these one-way cuts is, by 
induction, — • 75 -^— = 7 ^—. Thus the total cost is 1 + 75 -^— = . 

’ cni 2 — ot-i 2—a.i 2—oti 2—oti 

In both cases the cost is the same and hence the optimal two-way cut in G /+1 is 

Minimum one-way cut: We now calculate one-way cut from s to t. At least one of the edges (vi, v), ( v , V 2 ) 
has to be cut. Also, either there is no path from s to v or no path from v to t. Thus, the cost of the one-way cut 
from s to t is at least 1 + = 1 'Erx- Moreover it is easy to see that this is achievable by removing (y\,v) 

and one-way cut from s to v in H. 

Optimum fractional solution value: We now calculate the optimum for Dir-MC-Rel on G l+ ). We consider 
the following feasible solution x. Assign 0 to the infinite weight edges and 1/2 to each of edges {v\,v) and 
[v,V 2 )- For the edges in the graphs H and H' we take an optimum solution y to Dir-MC-Rel on (7; and 
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scale it down by 1/2 and assign these values to the edges of H and H'. Feasibility of y for Gy implies that 
distance from s to v and v to s in H according to x is 1/2 (since we scaled down by 1/2). It is easy to verify 
that distance of s to t and from t to s is 1 in the fractional solution x in Gi + \. Now we analyze the cost of this 
solution YleeE(G + 1 ) w e x e- We have a total contribution of 1 from the two edges (v \, v) and (v, vi )■ We claim 
that J2e£E(H) w e%e = \ ■ since the cost of the two-way cut in H is chosen to be the integrality 

gap is a* and we scaled down y by 1/2 to obtain x in H. Same holds for H'. Thus the total fractional cost of 
this solution is 1 + 2 ^ \y c can see that this is an optimum solution by exhibiting a multicommodity 

flow of the same value for the pairs (s, t ) and (t, s ) in G 1+ \. Route one unit of flow from s to t along the path 
s —> v\ —> v —> V 2 —> t. In H there exists a feasible flow of total value — • 7 ^— = . Let f(s,v) and 

f(v, s ) be the amount of flow from s to v and v to s respectively. By duplicating this flow in H’ we see that 
a flow of value exists between s and t in G / + 1 via H and H'. Thus there is a total flow of value at least 
1 + 2 ^a~ ^*+1 anc ^ i s optimal. 

We can now put together the preceding bounds to prove the lemma. The flow-cut gap in G/ + i is seen to be 
the ration of the two-way cut value and the maximum flow value / . Hence ctj+i = / /' as desired. 

The ratio of one-way cut value ar *d the two-way cut value in Gi + 1 is which is equal to ^ 7 - 
This completes the inductive proof. □ 

We have a sequence of numbers ay where ao = 1 and a^+i = It is easy to argue that this sequence 

converges to 2. This proves that the integrality gap of Dir-MC-Rel is in the limit equal to 2. 


B Fractionality of the LP solutions 

It was shown in llTflll that there is a half-integral optimum solution for the natural LP relaxation for node- 
weighted multiway cut (Node-wt-MC) which was then exploited to obtain a 2(1 — l//c)-approximation. lfl9ll 
also showed that the half-integral property does not hold for .st-Bi-Cut. Here we generalize their example to 
observe that for any positive integer i there are examples where there may not exist an optimum solution to 
Dir-MC-Rel on instances with two terminals that is 1 /l integral. More generally, there does not exists an 
edge with length more than 1 /i. 

Consider the generalization of the example in llT9l as shown in Fig [5] Each flow path from s to t or t to s 
has to use at least h edges of the type (rq, Uj+i) or (vj,Vj+ 1 ). Since, there are only 2 (h — 1) such edges, flow 
is upper bounded by 2 (h — 1 )/h. To see that this flow is also achievable, consider the following sets of paths. 
For 1 < i < h — 1, path Pi = s, u \,..., rq + i, ly ,... ,Vh,t and path P- = t,v 1 , • • •, ry+i, Ui ,..., Uh, s. Send 
1 jh unit of flow along each of these paths. Each of the edge (uj,u,j+ 1 ) is part of Pi for i > j and part of P\ for 
i < h — j. Hence, capacity used for edge is h ■ 1/h = 1. Similarly for each edge (?y, ry+i). Flow 

value is equal to 2 (h — 1 )/h. So, optimum solution has value 2(h — l)/h. 

Ul Vi 


t 



Figure 5: Edges of the form (m, u l+ 1 ) or (vj. v 3+ \ ) have capacity 1 and rest have infinite capacity. Optimal 
fractional cut/flow is 2(1 — 1 /h). 

By strong duality, optimal value of Dir-MC-Rel is equal to maximum flow which is equal to 2 (h — 1 )/h. 
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Let x be an optimal solution to the Dir-MC-Rel. By feasibility of the solution, each of the paths P[ and P- has 
length at least 1. Summing up the lengths of path Pi and P[, we get uj l+ \) + x(vj, Vj+i ))'j + 

x(ui,Ui + 1 ) + x(vi,Vi + 1 ) > 2. By optimality of the solution first term is equal to 2 (h — 1 )/h. Therefore, 
x(ui , Ui + 1 ) + x(yi , Vi + 1 ) > 2/h. Since, this inequality holds for all 1 < i < h — 1, and ^2jZl(x(uj, Uj + 1 ) + 
x(vj,Vj + 1 )) = 2 {h — 1 )/h, we get that all the inequalities are tight and x(ui,Ui + 1 ) + x(vi,Vi + 1 ) = 2 /h. 
Since, all lengths are non-negative, x(ui,Ui + i),x(vi,Vi + i) < 2/h. By taking h > 2i, we get an instance 
where optimal solution has no edge having length at least 1 /L 
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